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Abstract 

We propose a simple algorithm which produces a new category of networks, high 
dimensional random Apollonian networks, with small- world and scale- free charac- 
teristics. We derive analytical expressions for their degree distributions and cluster- 
ing coefficients which are determined by the dimension of the network. The values 
obtained for these parameters are in good agreement with simulation results and 
comparable to those coming from real networks. We estimate also analytically that 
the average path length of the networks increases at most logarithmically with the 
number of vertices. 
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1 Introduction 



Since the pioneering papers by Watts and Strogatz on small-world networks 
[1] and Barabasi and Albert on scale- free networks [2], complex networks have 
received considerable attention as an interdisciplinary subject [3,4,5]. Complex 
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networks describe many systems in nature and society, such as Internet [6], 
World Wide Web [7], metabolic networks [8], protein networks in the cell 
[9], co-author networks [10] and sexual networks [11], most of which share 
three apparent features: power-law degree distribution, small average path 
length (APL) and high clustering coefficient. In recent years, many evolving 
models [3,4,5] have been proposed to describe real-life networks. The origi- 
nal BA model [2,12] captures the two main mechanisms responsible for the 
power-law degree distribution of degrees, namely growth and preferential at- 
tachment. Dorogovtsev, Mendes, and Samukhin [13] gave an exact solution for 
a class of growing network models thanks to the use of a "master-equation" . 
Krapivsky, Redner, and Leyvraz [14] examined the effect of a nonlinear pref- 
erential attachment on network dynamics and topology. Amaral et al. [15] 
studied models that incorporate aging and cost and capacity constraints in 
order to explain the deviations from the power-law behavior in several real- 
life networks. Dorogovtsev and Mendes [16] also addressed the evolution of 
networks with aging of sites. Bianconi and Barabasi [17] offered a model ad- 
dressing the competitive aspect in many real networks such as World Wide 
Web. Additionally, in real systems a series of microscopic events shape the 
network evolution, including the addition or rewiring of new edges or the re- 
moval of vertices or edges. Albert and Barabasi [18] discussed a model that 
incorporates new edges between existing vertices and the rewiring of edges. 
Dorogovtsev and Mendes [19] considered a class of undirected models in which 
new edges are added between old vertices and existing edges can be removed. 
Although it is now established that preferential attachment can explain the 
power-law characteristic of networks, there is a wide range of microscopic al- 
ternative mechanisms that could affect the evolution of growing networks and 
still lead to the observed scale-free topologies. Kleinberg et al. [20] and Kumar 
et al. [21,22] proposed copying mechanisms motivated by the desire to explain 
the power-law degree distribution of the World Wide Web. Chung et al. [23] 
introduced also a duplication model for biological networks. Krapivsky and 
Render [24] presented edge redirection mechanisms which are mathematically 
equivalent to the model of Kumar et al [21,22]. Inspired by citation networks 
Vazquez proposed in [25] the walking mechanism. Actually scale-free net- 
works can be created by various methods. Cornelias, Fertin and Raspaud [26] 
introduced a category of graphs via a recursive construction. Specific recur- 
sive and general scale-free constructions are given also in [12,27,28,29,30] and 
scale-free trees (without clustering) in [31]. Zhou et al. construct in [32] an 
integer network using a pure mathematical method. 

In relation to the problem of Apollonian packing, a two-dimensional example 
of which is shown in Fig. 1, Andrade et al. introduced Apollonian networks [33] 
which were also proposed by Doye and Massen in [34]. It should be pointed 
out that high-dimensional Apollonian networks were already introduced in 
[34], but in this work the emphasis is placed on two-dimensional Apollonian 
networks and to provide a model to help understanding the energy landscape 
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Fig. 1. An Apollonian packing of disks within a circle. 

networks. Zhang et al. in [35] offer a simple general algorithm producing high- 
dimensional Apollonian networks and derive in detail analytical expressions 
for their order, size, degree distribution, clustering coefficient and diameter. 
Inspired also by the Apollonian packing, Zhou et al. [36] proposed a simple 
rule that generates random two-dimensional Apollonian networks with very 
large clustering coefficient and very small APL. 

In this paper, motivated by high-dimensional Apollonian packings, we present 
a simple iterative algorithm for high-dimensional random Apollonian networks 
(HDRAN) which extends the idea introduced in [36] for dimension two to 
any dimension. The algorithm can concretize the problems of abstract high- 
dimensional random Apollonian packings. Using the algorithm we determine 
relevant characteristics of HDRAN: scale-free degree distribution and large 
clustering coefficient which depend on the dimension of the Apollonian pack- 
ing. Based on the algorithm, we estimate also that the average path length of 
the networks increases logarithmically with the number of vertices. Therefore, 
these networks (HDRAN) have a small-world scale-free topology. 



2 The construction of high-dimensional random Apollonian net- 
works 

The name of Apollonian packings goes back to Apollonius of Perga (c. 262 
BC- c. 190 BC) who studied the problem of finding tangent circles to any 
three given circles. When the initial three circles are mutually tangent, there 
exist only two of such tangent circles, known as Apollonian circles or Soddy 
inner and outer circles. Frederick Soddy [37], Nobel Prize for Chemistry in 
1921, published in Nature a poem describing the equation which relates the 
radii of the circles and is now known as the "Kissing Circles Theorem" . This 
result, however, had been previously given by Rene Descartes in 1643 and 
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rediscovered in 1842 by the mathematician Phillip Beecroftan. A generaliza- 
tion to n-dimensional hyperspheres, also as a poem, was published as well in 
Nature by Thorold Gosset [38] and reads: 



And let us not confine our cares 

To simple circles, planes and spheres, 

But rise to hyper flats and bends 

Where kissing multiple appears, 

In n-ic space the kissing pairs 

Are hyperspheres, and Truth declares - 

As n + 2 such osculate 

Each with an n + 1 fold mate 

The square of the sum of all the bends 

Is n times the sum of their squares. 



In the procedure known as Apollonian packing or Apollonian gasket, we start 
with three mutually tangent circles, and draw their inner Soddy circles. Then 
we draw the inner Soddy circles of this circle with each pair of the original 
three, and the process is iterated, see Fig.l. This Apollonian packing can be 
used to design a network, where each circle is associated to a vertex of the 
network and vertices are connected if the corresponding circles are tangent. 
We call this network a two-dimensional Apollonian network [33,34]. 

In a similar way, it is possible to construct higher dimensional (ci-dimensional 
al > 2) Apollonian networks associated with other self-similar packings. We 
start with d+1 mutually tangent ci-dimensional hyperspheres (d- hyperspheres) 
in a d- dimensional simplex which is enclosed and tangent to a larger hyper- 
sphere, the interstices of which are curvilinear ci-dimensional simplices (ci- 
simplices). In the first iteration, and inside each of the interstices, we add 
ci-hyperspheres tangent to each of the d+1 ci-hyperspheres bounding the 
curvilinear ci-simplex. The added hyperspheres will not fill the interstice and 
each produces d+1 smaller interstices. Then, in the second iteration, d+1 
ci-hyperspheres are added inside all of the d+1 new interstices, being again 
tangent to the enclosing ci-hyperspheres. The process is repeated iteratively 
obtaining the high-dimensional Apollonian packing. If each ci-hyper sphere cor- 
responds to a vertex and vertices are connected by an edge if the corresponding 
ci-hyperspheres are tangent, then we obtain a ci-dimensional Apollonian net- 
work. 

In the above ci-dimensional Apollonian packings, if we add only one ci-hypersphere 
inside a randomly selected interstice in every iteration, after a certain number 
of iterations we have a high-dimensional random Apollonian packing. Anal- 
ogously, if each ci-hypersphere corresponds to a vertex and vertices are con- 
nected by an edge if the corresponding ci-hyperspheres are tangent, then one 
obtains ci-dimensional random Apollonian networks. In [36] it is discussed the 
special case ci = 2. Here we will consider the general case. It should be noted 
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that we consider the initial bounding <i-hypersphere at the same level as the 
other d + 1 initial d-hyperspheres. 



The iterative algorithm for high-dimensional random Apollonian 
networks 



In the iterative process for the construction of HDRAN, at each step and for 
each new hypersphere added, d + 1 new interstices are created in the packing, 
that will be filled in one of the following iterations. Equivalently, for each new 
vertex added, d + 1 new rf-simplices are created in the network, into which 
vertices will be inserted in one of the following iterations. According to this 
process, we introduce a general iterative algorithm for HDRAN. We denote the 
ci-dimensional random Apollonian network after t iterations by A(d, t), d > 2. 
Before introducing the algorithm we give the following definition. A complete 
graph Kd (also referred in the literature as ci-clique; see [40]) is a graph without 
loops whose d vertices are pairwise adjacent. Then the ci-dimensional random 
Apollonian network A(d, t) is constructed as follows. Initially (t — 0), A(d, 0) is 
the complete graph K^ +2 or (d+ 2)-clique. At each step, we choose an existing 
subgraph isomorphic to a (cZ+l)-clique that has never been selected before, then 
we add a new vertex and join it to all the vertices of the selected subgraph. 
The growing process is repeated until the network reaches the desired size. 
Note that when d = 2, our model gives the maximal planar network of [39]. 
Furthermore, since the network size is incremented by one with each step, in 
this paper, we use the step value t to represent a vertex created at this step. 
We can see easily that at step t, the network consists of N = t + d + 2 vertices. 
The total degree equals (d + 1) (2t + d + 2). So, when t is large the average 
vertex degree at step t is equal approximately to a constant value 2{d + 1), 
which shows that the network A(d, t) is sparse like many real-life networks 
[3,4,5]. 



Relevant characteristics of high-dimensional random Apollonian 
networks 



In this section we show that the dimension d is a tunable parameter which 
controls the relevant characteristics of a high dimensional random Apollonian 
network A(d, t). 
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4-1 Degree distribution 



The degree distribution is one of the most important statistical characteristics 
of a network. We use the mean-field method [3,41] to predict the growth 
dynamics of the individual vertices, and calculate analytically the connectivity 
distribution P(k) and the scaling exponents. Note that, after a new vertex is 
added, the number of (d + l)-cliques that can be chosen in the following 
step increases by d. Therefore, we can immediately see that after t steps, the 
number of (d + l)-cliques ready for selection are dt + d + 2. Given a vertex, 
when its degree increases by one, the number of (d + l)-cliques that contain 
the vertex increases by d — 1, so the number of (d + l)-cliques for selection 
containing the vertex i with degree fcj is (d — l)ki — d 2 + d + 2. If we consider 
k continuous, we can write for a vertex i 

dk = (d-l)k t -d 2 + d + 2 

dt dt + d + 2 U 



The solution of this equation, with the initial condition that vertex i was 
added to the network at tj with connectivity ki(ti) — d + 1, is 

j / \ d 2 -d-2 d+1 f dt + d + 2\^ 



The probability that a vertex has a connectivity ki(t) smaller than k, P{ki(t) < 
k), is 



P{h{t) <k) = P 



U > 



(dt + d + 2)(^) A d + 2 ) 



{ d( k -^y- 



(3) 



Assuming that we add the vertices to the network at equal intervals, the 
probability density of U is 



Substituting this into Eq. (3) we obtain that 



U > 



(dt + d + 2) 



d 
d-1 



d + 2 



-d-2\ d-1 d 



(5) 
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= 1 



u < 



(dt + d + 2)(^) d - 1 d + 2 



\ 



1 - 



(dt + d + 2) 



— d 



{d + 2 + t)d(k 



d + 2 

d 2 -d-2 \dh (d + 2 + t)d 
d-i 



Thus the degree distribution is 

p[k) = apjkm < t) = (dt+ rf+2)(rf+i)A ((<j _ i)jt _ {d2 _ d _ 2)) - (6) 

For large t 

, l-2d 

P(ife) = d(d + 1)3=t ((^ _ -d- 2)) (7) 

and if 3> d then P(k) ~ /c~ 7 giving a degree exponent 7(d) = ^5f • 

This value is different from the degree exponent for a high dimensional Apol- 
lonian network (non random HDAN) which is 7 = 1 + lo ^^ 1 - > [34,35]. This 
discrepancy would need further study, but a possible explanation is that for 
two networks with the same number of vertices, HDRAN and HDAN, the 
distribution of degrees differs because of the different growing process. The 
method used to obtain P(k) might have also some influence in the value of 7 
as it is explained at the end of the next subsection. 

When d = 2, one has 7(2) = 3 [36], while as d goes to infinity 7(00) = 2. 
Thus by tuning the parameter d, it is possible to obtain a variety of scale-free 
networks with different exponents in the range, 2 < 7 < 3. In Fig. 2, the 
degree distributions P(k) at various values of the dimension d are displayed 
and we find that at any value of d, the simulated degree distribution is in good 
agreement with the analytic value. 

Also, from Eqs. (2) and (7), we can notice that when t becomes large, the 
maximal degree of a vertex is roughly JV 1 ^ 7-1 ). 



4-2 Clustering coefficient 



The clustering coefficient of a vertex is the ratio of the total number of existing 
connections between all its k nearest neighbors and k(k — l)/2, the number 
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Fig. 2. Log-log graph of the cumulative degree distribution of HDRAN with order 
N = 10000 at various values of the dimension d. The squares (a), stars (b), circles 
(c), and triangles (d) denote the simulation results for d = 2, d = 3, d = 4, and 
d = 5, respectively. The reference dotted lines come from the evaluation of 7(d) as 
given by equation (7). 

of all possible connections between them. The clustering of the graph is ob- 
tained averaging over all its vertices. We can derive analytical expressions for 
the clustering C(k) for any vertex with degree k. When a vertex is created it 
is connected to all the vertices of a (d+ l)-clique whose vertices are completely 
interconnected. It follows that a vertex with degree k = d + 1 has a clustering 
coefficient of one because all the (d+l)d/2 possible links between its neighbors 
actually exist. In the following steps, a newly-created vertex links the consid- 
ered vertex whose d nearest neighbors are also connected by the new one at 
the same iteration step. Thus there is a one-to-one correspondence between 
the degree of a vertex and its clustering. For a vertex v of degree k, the exact 
expression for its clustering coefficient is 



C(k) 



^±H + d(k-d-l) 

k(k-l) 
2 



d(2k -d-V 
k{k-l) 



which depends on the degree k and the dimension d. This expression indicates 
that the local clustering scales as C(k) ~ k~ x for large k. Note that, as the 
clustering only depends on k for a given dimension, it matches the results 
found in [34] for HDAN. It is interesting to notice that a similar scaling has 
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Fig. 3. Clustering coefficient C of HDRAN at various values of dimension d. The 
squares, stars, circles and triangles denote d = 2, d = 3, d = A and d = 5, respec- 
tively. The dashed lines are fits to the simulated data. 

been observed in several real-life networks 



The clustering coefficient C of HDRAN can be obtained as the mean value of 
C{k) with respect to the degree distribution P{k) expressed by Eq. (7). The 
result is 



C= J C(k)P{k)dk = 

d+l 

- J d2{2k ~ ii k + 1)A « rf - * - ^ - 2 » w dk < 9 > 

For d = 2, equation (9) gives C = y — 36 In | = 0.7366, which is the same result 
obtained in [36]. For d = 3 we obtain C = 18+36\/2 arctan v2+§7r— 18^/2^ = 
0.8021. For d = 4 the analytical value is 0.8404 and for d = 5 it is 0.8658. 
We note that they do not depend on the order of the network. Note also that 
for d = 2 the value obtained is not far from the value of the clustering in 
the coauthorship network in neuroscience (0.76) [3]. Thus the clustering C of 
HDRAN is high and increases with the dimension d and approaches a limit of 1 
when d is large. In Fig. 3, we present simulation results for C vs the dimension d 
for various values of the N, the network order. For d = 2,3, 4, 5, the simulation 
results provide values of C of 0.768, 0.825, 0.855 and 0.877, respectively. We see 
that the small difference between analytical and simulation values decreases 
with d. This difference might be explained because the continuum approach 
does not properly account all the vertices, affecting P(k) which is used in Eq. 
(9) to obtain C, see [42]. 
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4-3 Average path length 



The average path length of a network (APL) is defined as the number of edges 
in the shortest path between two vertices averaged over all pairs of vertices. 
Below, using the same approach than in [36] we will discuss the APL of a 
high dimensional random Apollonian network. 

First we note that it is not very difficult to prove that for a HDRAN and for 
any two arbitrary vertices % and j each shortest path from i to j does not pass 
through any vertex k satisfying that k < max(i, j). 

If d(i,j) denotes the distance between % and j, we introduce the total distance 
of a HDRAN with order N as a(N) 

a(N)= £ d(i,j) (10) 

l<i<j<N 



and we denote the APL by L(N), defined as: 

L(N)= (11! 



According to the former remark, the addition of new vertices will not affect 
the distance between those already existing and we have: 

N 

<j(N + l) = a(N) + J2d(i,N + l) (12) 

i 



Assume that the vertex iV + 1 is added joining the (d+ l)-clique K composed 
of vertices Wi, w 2 , • • • , Wj, Wd+i then Eq. (12) can be rewritten as: 



N N 



a(N + 1) = a(N) + J2( D (h w) + l) = a(N) + N + £ D(i, w) (13) 



where D(i, y) = mm{d(i, wi), d(i, w 2 ), • • • , d(i, w^), d(i, wa+i}- Constricting the 
clique K continuously into a single vertex w (here we assume that w = Wi), 
we have D(i, w) = d(i, w). Since d(wi, w) = d(w 2 , w) = ■ ■ ■ = d(wa+i, w) = 0, 
Eq. (13) can be rewritten as: 

a(N +1) = a(N) + N + Y, d (^ w ) ( 14 ) 
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where T = {1, 2, • • • , iV} — {w±, W2, • • • , Wd, Wd+i} is a vertex set with cardinal- 
ity N — d — 1. The sum J2ier d{i,w) can be considered as the total distance 
from one vertex w to all the other vertices in our model with order N — d and 
it can be roughly approximated in terms of L(N — d): 

Y,d(i,w) « (N -d-l)L(N - d) (15) 
Note that, as L(N) increases monotonously with N, it is clear that: 

(s-*-w»-*> = *g^<?T da) 

From equations (14), (15) and (16), we can obtain the expression: 

a(N + 1) < a(N) + N+ (17) 

Considering Eq. (17) as an equation and not an inequality, we can provide an 
upper bound for the variation of cr(N) as 

Which gives 

a(N) = N 2 \nN + a, (19) 

where a is a constant. As c(N) ~ A^ 2 lnA^, we have L(N) ~ lnA^. Note that 
as we have deduced equation (19) from an inequality, then L(N) increases at 
most as lnA^ with N. 

In Fig. 4, we report the simulation results on the average path length L vs 
the dimension d for various values of the network order N, which agree well 
with the analytic results. Note that the APL decreases with the dimension d 
as the networks become more dense. The dependence of the APL on d, comes 
from the method used to obtain Eq. (14) as the sum there is on a set T with 
cardinality N — d — 1 . 



5 Conclusion and discussion 



In summary, we have introduced an iterative algorithm for the construction 
of high-dimensional random Apollonian networks associated with high dimen- 
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Fig. 4. Semilogarithmic plot of the average path length L of a HDRAN vs its network 
size N for different values of the dimension d. The squares, stars, circles and triangles 
denote the simulation results for d = 2, d = 3, d = A and d = 5, respectively. The 
lines are fits to the simulated data. One can see that L grows at most as lniV. 

sional random packings. The networks have the typical characteristics of many 
technological and social networks and are small-world with power-law degree 
distributions, thus they can model a variety of scale-free networks of the real- 
life world. We have computed the analytical expressions for the degree dis- 
tribution and clustering coefficient and we show that they depend on the di- 
mension. Also, the average path length of the networks grows logarithmically 
with the order of the HDRAN. Moreover, HDRAN are a generalization of the 
maximal planar network introduced in [39]. Future work should include study- 
ing in detail processes such as percolation, spreading, searching and diffusion 
taking place on HDRANs. 
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